Arabic Hate Speech Detection Using Deep Recurrent Neural Networks
نویسندگان
چکیده
With the vast number of comments posted daily on social media and other platforms, manually monitoring internet activity for possible national security risks or cyberbullying is an impossible task. However, with recent advances in machine learning (ML), automatic such posts becomes feasible. There still issue privacy internet; however, this study, only technical aspects designing automated system that could monitor detect hate speech Arabic language were targeted, which many companies, as Facebook, Twitter, others, use to prevent cyberbullying. For task, a unique dataset consisting 4203 classified into seven categories, including content against religion, racist content, gender equality, violent offensive insulting/bullying normal positive comments, negative was designed. The extensively preprocessed labeled, its features extracted. In addition, deep recurrent neural networks (RNNs) proposed classification detection speech. RNN architecture, called DRNN-2, consisted 10 layers 32 batch sizes 50 iterations Another model five hidden layers, DRNN-1, used binary classification. Using models, recognition rate 99.73% achieved classification, 95.38% three classes 84.14% comments. This accuracy high complex language, Arabic, different classes. higher than similar methods reported literature, whether three-class seven-class discussed literature review section.
منابع مشابه
Audio Visual Speech Recognition Using Deep Recurrent Neural Networks
In this work, we propose a training algorithm for an audiovisual automatic speech recognition (AV-ASR) system using deep recurrent neural network (RNN).First, we train a deep RNN acoustic model with a Connectionist Temporal Classification (CTC) objective function. The frame labels obtained from the acoustic model are then used to perform a non-linear dimensionality reduction of the visual featu...
متن کاملUsing Convolutional Neural Networks to Classify Hate-Speech
The paper introduces a deep learningbased Twitter hate-speech text classification system. The classifier assigns each tweet to one of four predefined categories: racism, sexism, both (racism and sexism) and non-hate-speech. Four Convolutional Neural Network models were trained on resp. character 4-grams, word vectors based on semantic information built using word2vec, randomly generated word ve...
متن کاملSpeech activity detection on youtube using deep neural networks
Speech activity detection (SAD) is an important first step in speech processing. Commonly used methods (e.g., frame-level classification using gaussian mixture models (GMMs)) work well under stationary noise conditions, but do not generalize well to domains such as YouTube, where videos may exhibit a diverse range of environmental conditions. One solution is to augment the conventional cepstral...
متن کاملDNA Steganalysis Using Deep Recurrent Neural Networks
Recent advances of next generation sequencing technologies have facilitated deoxyribonucleic acid (DNA) to be used as a novel covert channel in steganography. There exist various methods in other domains to detect hidden messages in conventional covert channels, however, they have not been applied to DNA steganography. The current most common detection schemes, frequency analysis-based methods,...
متن کاملSpeech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks
Quality of text-to-speech voices built from noisy recordings is diminished. In order to improve it we propose the use of a recurrent neural network to enhance acoustic parameters prior to training. We trained a deep recurrent neural network using a parallel database of noisy and clean acoustics parameters as input and output of the network. The database consisted of multiple speakers and divers...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app12126010